State-Conditioned Adversarial Subgoal Generation

نویسندگان

چکیده

Hierarchical reinforcement learning (HRL) proposes to solve difficult tasks by performing decision-making and control at successively higher levels of temporal abstraction. However, off-policy HRL often suffers from the problem a non-stationary high-level policy since low-level is constantly changing. In this paper, we propose novel approach for mitigating non-stationarity adversarially enforcing generate subgoals compatible with current instantiation policy. practice, adversarial implemented training simple state conditioned discriminator network concurrently which determines compatibility level subgoals. Comparison state-of-the-art algorithms shows that our improves both efficiency performance in challenging continuous tasks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neural Subgoal Generation using Backpropagation

Building a world model takes exponential computational costs with the number of obstacles. In real world applications are usually many obstacles, possibly changing their positions over time. In order to cope with a changing environment, a solution has to be adaptive. In order to plan complex trajectories, a system that plans hierarchically shows many advantages. In this article we report on res...

متن کامل

Feasibility Study: Subgoal Graphs on State Lattices

Search using subgoal graphs is a recent preprocessing-based path-planning algorithm that can find shortest paths on 8neighbor grids several orders of magnitude faster than A*, while requiring little preprocessing time and memory overhead. In this paper, we first generalize the ideas behind subgoal graphs to a framework that can be specialized to different types of environments (represented as w...

متن کامل

Adversarial Objectives for Text Generation

Language models can be used to generate text by iteratively sampling words conditioned on previously sampled words. In this work, we explore adversarial objectives to obtain good text generations by training a recurrent language model to keep its hidden state statistics during sampling similar to what it has sees during maximum likelihood estimation (MLE) training. We analyze the convergence of...

متن کامل

Adversarial Generation of Natural Language

Generative Adversarial Networks (GANs) have gathered a lot of attention from the computer vision community, yielding impressive results for image generation. Advances in the adversarial generation of natural language from noise however are not commensurate with the progress made in generating images, and still lag far behind likelihood based methods. In this paper, we take a step towards genera...

متن کامل

Adversarial Examples Generation and Defense Based on Generative Adversarial Network

We propose a novel generative adversarial network to generate and defend adversarial examples for deep neural networks (DNN). The adversarial stability of a network D is improved by training alternatively with an additional network G. Our experiment is carried out on MNIST, and the adversarial examples are generated in an efficient way compared with wildly-used gradient based methods. After tra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i8.26213